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Humans constantly adjust their social relationships and choose new partners of good reputations, thereby 
promoting the evolution of cooperation. Individuals have to pay a cost to build a reputation, obtain others' 
information and then make partnership adjustments, yet the conditions under which such costly behaviors 
are able to evolve remain to be explored. In this model, I assume that individuals have to pay a cost to adjust 
their partnerships. Furthermore, whether an individual can adjust his partnership based on reputation is 
determined by his strategic preference, which is updated via coevolution. Using the metaphor of a public 
goods game where the collective benefit is shared among all members of a group, the coupling dynamics of 
cooperation and partnership adjustment were numerically simulated. Partner- switching behavior cannot 
evolve in a public goods game with a low amplification factor. However, such an effect can be exempted by 
raising the productivity of public goods or the frequency of partnership adjustment. Moreover, costly 
partner-switching behavior is remarkably promoted by the condition that the mechanism of reputation 
evaluation considers its prosociality. A mechanism of reputation evaluation that praises both cooperative 
and partner-switching behaviors allows them to coevolve. 

The prevalence of cooperative or altruistic behaviors among individuals underpins human societies, giving 
rise to a marvelous complexity of societal and economic organizations^ Human individuals who are 
narrowly self-interested to maximize their personal welfare are termed as homo economicus. Although 
selfishness is a rational choice, a wide range of natural and experimental settings have demonstrated the ubiquity 
of irrational individuals who are willing to improve the welfare of others, the so-called homo reciprocans^. This 
phenomenon attracts extensive attention from evolutionary biologists, psychologists, economists, etc. In a simple 
game-theoretical model that characterizes the mutual interplay between individuals, two strategic options are 
available: defection (D) and cooperation (C). When a population of individuals playing such a game is exposed to 
natural selection or individuals autonomously imitate fitter ones, cooperation becomes an evolutionarily dis- 
advantaged strategy. Therefore, additional mechanisms are required to sustain the evolution of cooperation. 
Various mechanisms with the potential to meet this function can be grouped into several categories^. 

Reputation mechanisms play a crucial role in human cooperation. Increased cognitive capacity allowed 
recently evolved primates to track reputations in a way that other species could not. For example, only chim- 
panzees and humans exhibit the capacity to retain information regarding third-party interactions^. Interestingly, 
concern for self- reputation is observed in human children rather than chimpanzees^. Humans are often indirectly 
reciprocal, i.e., I help you and someone else helps me, which requires a building of reputation^. Even among 
repeated encounters, reputation is vital to establish the cooperative order. An individual's reputation reflects his 
willingness to cooperate with others. Notably, some second-order altruistic behaviors can create the reputation 
effect as well, i.e., altruistic punishment^°. Although an altruistic individual suffers the cost of helping others, he 
will most likely be recognized and rewarded by other counterparts in the future. In the long run, cooperative 
behaviors as well as a system of reputation evaluation and tracking can be fostered. Gossip or rumor may 
contribute to achieving a similar purpose ^\ 

In indirect reciprocity models, the central issue is how to evaluate and quantify an individual's behaviors. By 
considering different levels of information, including the donors' behaviors, the receivers' reputations and the 
donors' reputations, reputation can be evaluated delicately^'^^"^^. Individuals can take various strategies against 
others according to their reputations. The evolution of indirect reciprocity depends on praising altruistic beha- 
viors and suppressing selfish behaviors. Rockenbach and Milinski found in a human experiment that the 
frequency of costly punishment acts was reduced to a low level in the presence of indirect reciprocity^^. A 
theoretical study showed that there is only a small parameter region where costly punishment is more efficient 
than strategies of indirect reciprocity^^. An empirical study conducted in a Peruvian highland community 
indicates that persons who contributed more to the collective action had better reputations for various qualities. 



SCIENTIFIC REPORT; | 4 : 5957 | DOI: 1 0. 1 038/srep05957 



1 



Larger support networks and healthier states distinguished house- 
holds of greater reputations^^. However, the cost of reputation build- 
ing and the observability of reputation information are the factors 
that condition the success of indirect reciprocity^^'^^. A recent experi- 
ment produced the interesting result that money can serve as a sym- 
bol of trust among strangers, i.e., an analog of reputation^". In their 
experiment, subjects gave worthless tokens to reward others' help 
and demanded a token in exchange for help. Trust among strangers 
could be developed as the social groups grew larger and tokens 
endogenously became money. 

With the aid of reputation, cooperation can be enhanced in net- 
work reciprocity models. When interaction patterns among indivi- 
duals are structured as a network, the equilibrium frequency of 
cooperation in the whole population exhibits a second- order phase 
transition as the benefit-to- cost ratio increases^^'^^. Such a mech- 
anism is referred to as network reciprocity, and such games played 
by networked individuals are referred to as spatial games or net- 
worked games. Researchers have found that the complement of repu- 
tation mechanisms favors cooperation in network reciprocity 
models. Brandt et al studied the effect of reputation in spatial public 
goods games (PGGs)^^. In the presence of reputation, highly coop- 
erative and fair outcomes are achieved. Wang et al proposed a part- 
ner selection rule on the basis of evaluating both reputations of a pair 
in a spatial PGG^'*. If two adjacent players' reputations are within the 
tolerance range, the physical connection can be transformed into an 
interaction relationship. In this setting, a moderate tolerance range of 
reputation results in the best environments for cooperation. 

In the real world, human social networks are relatively static at the 
macroscopic scale, whereas individuals frequently adjust their social 
relationships at the local level. Therefore, partnership dynamics must 
be incorporated into spatial game models. Zimmermann et al. pro- 
posed a model where a spatial game is played by agents and the 
network of interaction links evolves to adapt to the outcome of the 
game^^ A cooperator can dismiss an interaction with a defector and 
switch to another randomly chosen agent. This activity triggers the 
emergence of highly connected nodes and a highly cooperative sta- 
tionary state. More recently, the coevolution of cooperation and 
partnership networks has led to a considerable amount of experi- 
mental and theoretical investigation^^"^^. 

Fu et al. introduced the mechanism of reputation-based partner 
choice into the spatial prisoner's dilemma game^^. Individuals can 
either change their strategies by imitating their partners or adjust 
their partnerships based on local information regarding reputations. 
Such a mechanism of partner switching based on reputation brings 
about a significant promotion of cooperation with the evolution of a 
heterogeneous partnership network. When individuals play PGGs, 
the stationary density of cooperators depends on the group size, and 
reputation-based partner selection can improve cooperation remark- 
ably^^. In an experimental setting, Sylwester and Roberts showed that 
cooperators benefit from these reputation-based partner- switching 
behaviors^^. Even if agents are restricted to adjusting within their 
geographical neighborhood, the reputation-based partner choice 
promotes cooperation efficiently^^. It is worth pointing out that uni- 
lateral reputation-based partner switching in a non- excludable PGG 
cannot lead to the promotion of cooperation^". 

The puzzle of rampant cooperation in humans lies in the fact that a 
selfish individual helps another at a cost to itself Cost is also the 
byproduct of other prosocial behaviors, e.g., costly punishment^\ 
Recently, Suzuki and Kimura reported that indirect reciprocity is 
sensitive to the costs of information transfer. Their work introduced 
the hypothesis that information transfer (or reputation building) is 
costly into indirect reciprocity models, making indirect reciprocal 
behaviors as costly as cooperative behaviors. Although indirect reci- 
procity helps explain cooperation among unrelated strangers, cost 
might be a handicap. To the best of my knowledge, the effects of the 
cost of reputation-based partner switching on cooperation have sel- 



dom been considered in previous partner-switching models. The 
present model extends previous ones primarily by considering this 
cost of partner-switching behaviors. 

In different contexts, social dilemmas frequently involve a group 
of individuals. For example, say a number of players maintain a 
public good from which all members benefit unconditionally. In such 
a situation, a homo economicus should choose the defection strategy. 
Referred to as partnership adjustment in multiple-person games, a 
player chooses a group to join. In most spatial models of PGG, each 
agent hosts a game and takes part in games hosted by others. The 
group selection problem de facto becomes the individual selection 
problem. A homo economicus is likely to choose an individual who 
can maximize his personal welfare, whereas a homo reciprocans most 
likely chooses another counterpart. If the partner- switching beha- 
viors are costly, the evolution of such costly behaviors becomes as 
problematic as the evolution of costly cooperative behaviors. In this 
model, I do not presume that the partner- switching behavior of an 
individual is unchanging during the game's evolution. Partner- 
switching behavior can be imitated or learned in the same way as 
game-strategic behavior. Using numerical simulations, I show that a 
mechanism of reputation evaluation that praises both cooperative 
and partner-switching behaviors allows them to coevolve. 

Model 

Strategic behaviors. The present model describes the evolution of 
genetically transmitted behavioral types in a population of semi- 
rational or rational agents who engage in game interactions and 
partnership adjustments. There are two classes of individual beha- 
viors regarding partner switching and game playing. The particular 
behavior of an agent is determined by his preferences. Two opposite 
preferences, i.e., C and D, condition agents' behavioral choices in 
game playing. Agents can adjust their partnerships under a reputation- 
maximization rule (R) or do nothing (N). Such a partner- switching 
rule enables an agent to switch from the lowest- reputation neighbor 
to the highest-reputation one among their next-nearest neighbors. 
Therefore, agents' behaviors are represented as the expression of two 
hypothetical alleles at each of two loci. CR agents cooperate in the 
period of game playing and adjust their partnerships based upon 
reputation during partner switching. In the same manner, the 
behaviors of DR, CN, and DN agents are determined. For the sake 
of mathematical expression, agent is strategy S/ is denoted as a two- 
dimensional binary vector [sn, S/2]. The first dimension expresses the 
cooperative preference, and the second dimension expresses the 
preference for partner switching. To be more specific, S/i = 1 
means that agent / is cooperative, whereas Sn = 0 means that 
agent / exhibits the opposite preference. For partner switching, S/2 
= 1 means that agent / is willing to adjust his partnership if necessary 
and pay the cost associated with such a behavior, whereas S/2 = 0 
means that agent / never makes any adjustment. 

Game interactions. Agents autonomously organize into a number of 
groups in which members in the same group play a game together. 
Their interaction patterns, i.e., who interacts with whom, are defined 
by a dynamical network. Agents are placed on the nodes, which are 
connected by links. Each agent can sponsor a game in which his 
linked neighbors can participate. In a game sponsored by agent /, 
agent j obtains a payoff (/) if he has a link to / and thus plays as a 
group member. The evolutionary fitness of an agent is the 
accumulated payoffs obtained from all the games that he is 
involved in, i.e., 

m=psXi)+J2p^Xj\ (1) 

where J\f] is the set of linked neighbors of agent /. 
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I resort to a PGG as a metaphor to model the interactions among 
group members. PGGs describe a situation where the collective bene- 
fit is shared among the members in the group. In a typical PGG, 
cooperators contribute a fixed amount of effort c to the common 
pool. The amount of the overall contributions is amplified by a factor 
a and shared among all group members. In a PGG sponsored by 
agent /, agent j obtains a payoff as a group member according to the 
following: 



Psjii) 



Ginc{i)c 
Ginc{i)c 

KI+1 



if Sji=0, 



else. 



(2) 



In this function, ndi) is defined as the number of cooperative agents 
in group /. If agent j is a cooperator, i.e., Sji = 1, then ndi) is increased 
by 1. \j\f^ I is the number of agent is linked neighbors. 

Coevolutionary dynamics. Agents engage in the two coupled 
evolutionary processes of partnership adjustment and strategy 
update. At each time step of the coevolution, a random agent 
adjusts his partnership with probability Pr or otherwise updates his 
strategy by imitation. The dynamics of strategy update is inherited 
directly from previous models; namely, smoothed imitation 
dynamics is adopted^^'^^. The probability that agent / successfully 
adopts the strategy of one of his neighbor j is 



i+exp[-(f{j)-f{i))/ey 



(3) 



where agent; (jeAf]) is chosen randomly and 6 is the noise extent. 
The two alleles of j are transmitted to / as a whole if the imitation 
occurs; thereafter, S/i = Sji and S/2 = Sj2. 

Partner-switching behaviors. In agreement with previous studies, 
individuals adjust their partnerships by choosing the candidate with 
the maximal reputation. The minimum information required for a 
partner- switching agent is the accurate reputations of his nearest 
neighbors and next-nearest neighbors. The renewal of each agent's 
reputation score occurs at every time step according to the following 
equation^^: 

ki{t)=lki{t-l)+Siu (4) 

where 1 is the discounting rate of reputation. The score of an 
individual's reputation is his previous score plus the performance 
of the game-strategic behavior in the current time step. 

This model introduces four strategies regarding cooperation and 
partnership adjustment. I propose an improved reputation evalu- 
ation function that further values partner-switching behavior. As a 
simple improvement on Eq. 4, an individual's game-strategic and 
partner- switching behaviors can be weighted equally: 

ki{t) = Xki{t-l)+Sn+Si2. (5) 

When agent / (s/2 = 1) modifies his partnership, he severs the neigh- 
bor who has the minimum reputation among all his nearest neigh- 
bors: arg mm{kj{t)) {jej\fi). Then, he chooses as a new partner the 

j 

individual who has the maximum reputation among all his next- 
nearest neighbors: arg max(A;j(t)) (jeM^). In a model of friendship 

formation^^ an agent incurs an opportunity cost regardless of the 
outcome for each unit of time in the matching process. In this model, 
the severed agent is the one with the minimum reputation, who may 
play as an exploiter in the PGG hosted by the focal agent. Therefore, 
the focal agent has an incentive to adjust the partnership even at a 
cost to himself Moreover, a reputation system incurs more costs, e.g., 
the cost of information dissemination and maintenance and the cost 



of reputation evaluation. I assume that these costs are shared by all 
the agents involved in this process. Specifically, each agent who 
exhibits the R trait has to pay a certain cost Cr. 



/'(i)=/(0-C„ if 5,2 = 1. 



(6) 



In a reputation system where game-strategic behaviors are consid- 
ered, the second- order assessment also depends on the score of the 
receiver^; for example, it can be deemed bad to help a bad person. The 
partner- switching activity defined here will possibly increase the 
overall welfare of the public goods hosted by the focal agent. The R 
trait of an agent also implies that he accepts his responsibility to share 
the cost during the partner- switching process. For others, or at least 
linked neighbors, the partner- switching behavior of the focal agent 
can be deemed a prosocial behavior. It is therefore assumed that a 
partner- switching behavior raises the focal agent's reputation, which 
is conveyed in Eq. 5. 

Results 

I conducted numerical simulations using a wide range of parameters. 
To quantify the equilibrium frequencies of strategies, the average 
fraction of a strategy was sampled during the last 1000 steps after 
sufficient Monte Carlo steps (MCSs). Under the same parameter 
setting, data were further averaged over 32 independent simulations. 
Pc is the equilibrium frequency of strategy C, whereas Pr is the 
equilibrium frequency of strategy R. Due to the disconnection of 
interactions, some agents are isolated without any connections to 
others. These independent agents were excluded in the computation 
of strategy frequencies. Without specification, the initial population 
consisted of 2500 agents whose strategies were random, and the 
initial interaction network was a periodical square lattice {{d) = 4). 
Following common practice, a middle level of imitation noise {6 = 
0.1) was set. The default setting of the discounting rate of reputation 
is 0.5. 

Phase diagrams of the equilibrium frequencies of the behaviors. In 

all simulations, c = 1. Therefore, the partner- switching behavior 
presumably falls within the range of 0 < c^^ < 1. When Cr= 1, the 
partner- switching behavior becomes as costly as the cooperative 
behavior. Previous literature has shown that the average vertex 
degree (d) is an important factor for the evolution of cooperation 
in networked games^^. Here, I adopt a normalized amplification 
factor of PGG rj = (x/({d) + 1). In Fig. 1, the simulation results are 
shown under two sets of simulations. The top panel shows the 
situation at a low partner- switching probability (Pr = 0.2), 
whereas the bottom shows the situation with the same parameter 
values as the top panel except Pr = 0.8. 

In Fig. 1(a), pc increases with increasing r]. A higher level of Cr 
increases the critical value rj'c for r], above which cooperators coexist 
with defectors. However, such an effect is weak. In Fig. 1(b), the phase 
of Pr exhibits a nonlinear transition. Pr does not always increase with 
increasing ?]. On the whole parameter plane, Pr does not reach 70%. 
In the parameter range of 0 < < 0.6, p^^ decreases sharply to 0 with 
increasing Cr; that is, the reputation-based partner- switching beha- 
vior is sensitive to its cost. This result corresponds with the work of 
Suzuki and Kimura. However, the decrease in Pr becomes gentler at 
higher levels of ?]. Let us analyze the situation at Pr = 0.8. The critical 
value r]'c for the emergence of cooperation is lowered remarkably. In 
Fig. 1(d), the nonlinear phenomenon is more obvious than that in 
Fig. 1(b). In the phase where pc experiences a transition, Pr reaches 
the maximum. In other words, the evolution of cooperation and the 
evolution of reputation-based partnership adjustment might be 
correlated. 

Coevolution under the improved reputation function. Compared 
with Eq. 4, the improved function of reputation (Eq. 5) can evaluate 
individuals' behaviors with respect to both game playing and partner 
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Figure 1 | The effects of the cost of partner switching on equiUbrium strategy frequencies. Top panel: the partner-switching probabiUty Pr = 0.2; 
bottom panel: Pr = 0.8. In each panel, the frequency of cooperative strategy pc and the frequency of reputation-based partner- switching strategy are 
shown as a function of both the normalized amplification factor and the cost of partner switching Cj?), respectively. In (b), Pr decreases to 0 rapidly with 
increasing Cr in the parameter range of 0 < < 0.6. When vj becomes larger, the evolution of partner-switching behavior is facilitated. In (d), there is no 
phase where Pr rapidly decreases to 0 with increasing Cr. 



switching. Fig. 2 shows the equilibrium frequencies in the situation 
under Eq. 5. Figs. 2 and 1 can be compared directly because they 
share the same parameter and simulation settings. Fig. 2(a) looks 
quite similar to Fig. 1(a). In Fig. 2(c), pc experiences a slower 
transition from 0 to the maximum. The phase diagrams of 
partner- switching behavior under the improved rule display more 
interesting results. Pr is much higher in Fig. 2(b) than in Fig. 1(b). In 
Fig. 2(d), is increased on average as well. The enhancement of the 
frequencies of partner- switching behavior can be attributed to the 
improvement of reputation evaluation. In Fig. 2(d), Pr does not vary 
significantly with the variation of Cr. The influence of the cost 
associated with partner- switching behavior is trivial when agents 
can perform such behaviors more frequently. Moreover, the 
correlation between cooperation and reputation-based partnership 
adjustment can also be observed. Before stepping into the next 
analysis, we can conclude that the prevalence of reputation-based 
partner- switching behavior is aided by some refined rules of 
reputation evaluation. 

The phase diagrams of strategy frequencies with fixed normalized 
amplification factors of PGG are depicted in Fig. 3. Two cases of 
normalized amplification factors are studied: 0.7 and 0.9. The former 
setting allows a phase transition of po whereas the latter setting 
enables a high-level pc- At the same level of Cr, pc does not increase 
monotonously with Pr, and neither does Pr. Remember that a ran- 
dom agent reconnects to a new one with probability Pr or otherwise 
updates his strategy. Hence, an optimal partner-switching probabil- 



ity for the emergence of prosocial behaviors is likely to be a medium 
value. In the bottom panel, the optimal value of Pr shifts toward 0. 
Empirical research has shown that intermediate levels of change in 
social ties lead to optimal levels of cooperation in a series of online 
experiments^^. The simulation result of this model echoes the empir- 
ical finding. 

As a complement to phase diagrams of the C and R traits. Fig. 4 
shows equilibrium frequencies of the CR, CN, DR and DN strategies 
as a function of the cost of partner switching, the amplification factor 
and the partner- switching probability, respectively. Fig. 4(a) shows 
that, with increasing Cr, the equilibrium frequency of the CR strategy 
PcR decreases. Let us analyze the situations where the equilibrium 
frequencies of prosocial behaviors experience a phase transition from 
0 to certain levels. In Fig. 4(b), pcR increases with increasing ?]. 
Meanwhile, poN decreases with increasing ?]. When the social pro- 
ductivity is at a fixed level, as shown in Fig. 4(c), the equilibrium 
frequency of the CR strategy does not increase monotonously with 
the partner- switching probability. A high frequency of partnership 
adjustment might not lead to an optimal social status, which is in 
accord with the analysis of Fig. 3. In both Figs. 4(b) and (c), the CN 
strategy is promoted weakly. Therefore, the CR strategy has an evolu- 
tionary advantage over the CN strategy under the improved repu- 
tation function. Moreover, it can be seen that the DR strategy is 
suppressed in the coevolution. 

Fig. 5 is devoted to investigating the effect of a high partner- 
switching probability on the coevolution of cooperation and 
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Figure 2 | The effects of the cost of partner switching on equiUbrium strategy frequencies under an improved reputation evaluation rule as described in 

Eq. 5. Top panel: partner-switching probability Pr = 0.2; bottom panel: Pr = 0.8. In each panel, the frequency of cooperation pc and the frequency of 
reputation-based partnership adjustment Pr as a function of both the normalized amplification factor and the cost of partner switching (rj, Cr) are 
presented, respectively. The improved rule takes the prosociality of reputation-based partner-switching behaviors into account in the evaluation of 
reputation. Compared with Fig. 1, such a rule enhances Pr significantly. 



reputation-based partnership adjustment. For visual convenience, a 
population consisting of 256 agents was set. The simulations were 
performed using high values for Cr and rj. The situations at Pr = 0.2 
and Pr = 0.8 were compared. I sampled the detailed partnership 
networks and investigated the strong effect of partner switching over 
network structures. In Fig. 5, different agents are colored differently. 
Clearly, more separate subnetworks and isolated nodes were 
developed in the situation at Pr = 0.8. Agents that are segregated 
from the most highly connected subgraph have less of a chance to 
imitate prosocial agents. Therefore, a high-level partner- switching 
probability might not be good for the evolution of either cooperative 
or reputation-based partner-switching behaviors. 

The investigation of reputation-based reciprocity. Indirect reci- 
procity is unique in that one individual has helped another and 
therefore is helped by a third one. Such a reciprocity pattern pivots 
on reputation, which is called "reputation-based reciprocity" or 
"downstream reciprocity"^. Whether the coevolution of prosocial 
behaviors in this model is ascribable to reputation-based recipro- 
city was further investigated. The formation of new links manifests 
the basic pattern of reputation-based reciprocity. Consider a new link 
reconnecting to a cooperator: the more likely a cooperator is at the 
other end, the stronger the effect of reputation-based reciprocity. A 
proper measure to quantify this effect in a period of evolutionary 



time is the ratio of the cooperator-cooperator links to the total 

^cc 

number of links reconnecting to cooperators — . 

he + foe 

Two simulations were run with the same parameters except for 
different amplification factors, which resulted in two typical coevo- 
lution processes with distinct outcomes, as shown in Fig. 6. - — — — , 

he + foe 

Pc and Pr are displayed as a function of evolutionary time step, and 
the time window for sampling was 1000 MCSs. Fig. 6(a) displays an 
evolutionary process in which a prosocial regime emerged success - 

he 

fully, whereas Fig. 6(b) reports a negative example. — exhibits 

he + foe 

remarkably different levels in these two processes. When pc and Pr 

he 

experience a transition to high levels, as shown in Fig. 6(a), — 

he + foe 

exhibits high levels transiently. Additionally, there is a period of time 

he 

when — is mostly 100%. When the coevolution arrives at a 

he + foe 

stationary stage, there are no new links reconnecting to cooperators. 
This comparison analysis indicates that a high level of social produc- 
tivity creates a niche for reputation-based reciprocity, which allows 
its functionality in the achievement of high frequencies of prosocial 
behaviors. Thus, the basic pattern of reputation-based reciprocity 
underlies the coevolution in this model. 
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Pr Pr 

Figure 3 | The effects of the cost of partner switching on equiUbrium strategy frequencies with fixed normalized amplification factors of PGG. 

Top panel: the normalized amplification factor rj = 0.7; bottom panel: rj = 0.9. In each panel, the equilibrium frequency of the cooperative behavior pc 
and the equilibrium frequency of the partner-switching behavior as a function of both the partner-switching probability and the cost of partner 
switching (Pr, Cr) are displayed, respectively. The average degree of the partnership network (d) = 4. 



The effects of other factors. So far, I have investigated the principal 
factors that condition the coevolutionary dynamics under study. 
Finally, the effects of reputation decay and the average number of 
partners are examined. I report the equilibrium frequencies as a 
function of Cr under different combinations of Pr and the 



discounting rate of reputation X in Fig. 7. Note that previous 
results were obtained from simulations at X = 0.5. When the 
partner- switching probability is small (Pr = 0.2), different repu- 
tation decay rates have little influence on the equilibrium strategy 
frequencies. However, a full memory of previous reputation scores 
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Figure 4 | The equilibrium frequencies of four strategies as a function of the cost of partner switching Cr, the normalized amplification factor j/ and the 
partner-switching probability Pr, respectively, (a) rj = 0.7, Pr = 0.2; (b) Cr = 0.5, Pr = 0.2; (c) Cr = 0.5, f] = 0.7. The data were sampled during the 
simulations for Figs. 2 and 3. In (a), pcR decreases with increasing Cr, whereas px)iv increases with increasing Cr. Subfigures (b) and (c) show the 
situations where the cost of partner switching is fixed. At a given partner-switching probability, the CR strategy becomes the fittest as rj increases. With a 
fixed rjy the highest frequency of CR occurs at a middle value of Pr. 
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Figure 5 | An analysis of the strong effect of partner switching on the coevolution of cooperation and reputation-based partnership adjustment, (a) An 

interaction network sampled when the population reached the stationary regime at partner- switching probability Pr = 0.2; (b) An interaction network 
sampled when the population reached the stationary regime at Pr = 0.8. Blue color denotes the CR strategy; yellow denotes the CN strategy; green denotes 
the DR strategy; and red denotes the DN strategy. The total number of agents is 256. A high frequency of partner switching induces more separate 
subnetworks and isolated vertices. Other parameter values are the cost of partner switching Cr = 0.96, the normalized amplification factor rj = 0.98 and 
the average degree of the partnership network (d) = 4. These pictures were drawn in Pajek^^. 



= 1) affects the coevolution when the partner- switching 
probability increases. At Pr = 0.4, 0.8, the trajectories for >^ = 1 
are lower than the trajectories for = 0 or 0.5. 

The simulation result with (d) = 8 is displayed in Fig. 8. The 
parameters are the same as those of the top panel of Fig. 2. 
Remarkably, both the critical value of f] for cooperation and the 
critical value of rj for partner switching are lowered. Additionally, 
the transitional phases of both pc and Pr from 0 to high levels 
become narrower compared with Fig. 2. When interaction patterns 
are fixed, a higher average vertex degree usually makes the evolution 
of cooperation less likely. However, it is much different in situations 
where agents dynamically adjust their social relationships. 

Discussion and Conclusions 

Reputation can steer individuals' practice of partnership adjustment, 
resulting in a selection force against defectors. Fu et at. proposed a 



model that describes the coevolution of game playing and partner 
switching^^. At each time step, a random individual can either alter 
his strategy by imitating his nearest partners or adjusts his partner- 
ships by severing the tie to the neighbor with the lowest reputation 
and reconnecting to a new partner. They found that reconnecting to 
the next-nearest neighbor who has the highest reputation is more 
beneficial to cooperation than random adjustment. Notwithstanding 
the inheritance of reputation-based partnership adjustment, this 
model advances on previous ones in two key respects. First, the 
individual's partner- switching behavior is governed by an allele 
and is heritable in this model; namely, an agent does not have a fixed 
partner- switching preference during his evolutionary life. Second, 
individuals who are willing to adjust their partnerships have to pay 
a cost for evaluating reputation and broadcasting information. These 
assumptions allow us to investigate the conditions for the evolution 
of costly partner- switching behaviors in a reputation system. By 
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Figure 6 | The manifestation of the basic pattern of reputation-based reciprocity. Two parallel simulations were sampled: (a) A high level of 
reputation-based reciprocal effects allows the prevalence of prosocial behaviors (the normalized amplification factor y] = 0.9); (b) A low level of 
reputation-based reciprocal effects leads to the extinction of prosocial behaviors = 0.5). The basic pattern of reputation-based reciprocity is rooted in 
the formation of new links. The level of reputation-based reciprocity is measured by the ratio of the cooperator-cooperator links to the total number of 
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links reconnecting to cooperators, — , in a period of evolutionary time. For both cases, other parameters are the partner-switching probability 

t-CC + t-DC 

Pr = 0.2 and the cost of partner switching Cr = 0.5. 
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Figure 7 | The effects of the discounting of reputation score on the coevolution of cooperation and reputation-based partnership adjustment. 

(a) The C frequency pc as a function of the cost of partner switching Cr under different combinations of the partner- switching probability Pr and the 
discounting rate of reputation /i; (b) The R frequency as a function of Cr under different combinations of Pr and /I. As shown, the discounting of the 
reputation score has a weak influence over the strategy frequencies. The normalized amplification factor rj = 0.7. 
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Figure 8 | Phase diagrams of the equilibrium strategy frequencies when agents have more partners on average. The initial population structure was a 
square lattice with a Moore neighborhood, i.e., the average degree of the partnership network (d) = 8. Top panel: the partner- switching probability Pr = 
0.2; bottom panel: Pr = 0.8. In each panel, the C frequency pc and the R frequency Pr as a function of both the normalized amplification factor and the 



cost of partner switching (rj, Cr) are presented, respectively. Other parameters are the same as those of Fig. 2. Comparing Fig. 8(b) with Fig. 2(b), the 
parameter range of rj where Pr decreases rapidly with increasing Cr becomes narrower. 
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numeric simulations, I obtained two-dimensional phase diagrams 
for game- strategic behaviors and partner- switching behaviors in a 
wide parameter range. I show that partner- switching behaviors can 
evolve even in the presence of a certain cost. There is a concurrence 
between the evolution of cooperative behaviors and the evolution of 
partner- switching behaviors, implying a mutual enhancement effect 
between these two prosocial behaviors. 

In an analytical model, Suzuki and Kimura showed that selfish 
individuals never build a reputation and that indirect reciprocity fails 
once reputation building is costly^^. At a high level, the present results 
are consistent with the prior work in that the emergence and susten- 
ance of reputation-based partner-switching behaviors are sensitive to 
their costs within a certain parameter range. However, such a han- 
dicap can be overcome with a large amplification factor of social 
productivity or a high probability of partner switching. Moreover, 
an improved rule of reputation evaluation counting the score of the 
partner- switching behavior significantly promotes its evolution, 
which correspondingly facilitates the evolution of cooperation. 
This result highlights the importance of reputation design in solving 
free-riding in other prosocial behaviors. In real societies, the archi- 
tects of a social system should become aware of the power of policy 
design, which can channel the individual's interest toward the 
achievement of prosociality. Notably, the partner- switching models 
are different from typical models of indirect reciprocity in that there 
is an underlying interaction structure. Although the interaction net- 
work is dynamic with respect to a large time scale, agents transiently 
have relatively fixed partnerships. As is discussed in Suzuki and 
Kimura's article, network reciprocity may support the evolution of 
cognitive abilities to build reputations and thus indirect reciprocity. I 
show that in a dynamical social network, partner- switching beha- 
viors based on reputation can evolve regardless of their costs. 

More generally, these results might be extended to the case of 
assortative matching or homophily, i.e., where individuals with sim- 
ilar genotypes, phenotypes or strategies are more likely to interact 
with each other^^'^^. Assortative partner choice promotes cooperation 
in human experiments adopting both a PGG^^'^^ and a prisoner's 
dilemma game^°. When humans can choose the subjects with whom 
they wish to interact, partner choice creates the possibility of altruism 
provided that individuals compete for good partners^\ In fact, the 
partner choice theories have been tested in some biological environ- 
ments, e.g., cleaner mutualism among fish^^. A theoretical model 
reveals that homophily can evolve under a wide variety of condi- 
tions^^ and a genome-wide analysis of correlation in genotypes 
between human friends suggests that homophily may yield an evolu- 
tionary advantage^^. Although the cost is a byproduct of the behavior 
of assortative matching or homophily, those self-evolved agents 
might develop an ability to distinguish their counterparts. Never- 
theless, this model is primitive, and some interesting refinements 
could be made. For example, partner-switching behavior is defined 
here as a binary value. A more realistic assumption is that an agent 
exhibits this behavior with a probability that is a continuous value 
and endogenously determined based on the agent's own interest. 
Furthermore, harnessing the free riding of public goods under stric- 
ter conditions remains to be explored. It is pivotal that a cooperator 
can end the mutual relationship between himself and a defector in 
this model. In the case of nonexcludable public goods, however, one 
cannot exclude others' participation in his game; an individual can 
merely adjust his own participation^^. Once a cost is associated with 
such partner- switching behaviors, a reputation mechanism might 
be redesigned, or some other mechanisms may play a role as a 
complement. 

The building blocks of humanity are a variety of prosocial beha- 
viors with a prominent feature of cost. I show that a regime where 
cooperative and partner- switching behaviors persist collectively can 
be sustained on the conditions that reputation mechanisms are well 
designed and that some parameters are tuned. This work is beneficial 



to understanding the conundrum that humans are willing to involve 
in costly but prosocial behaviors regarding reputations. 

Methods 

As mentioned above, the analysis is based on a coevolution model of cooperation and 
partnership adjustment. Each agent has two hypothetical alleles which determine his 
behaviors for game playing and partner switching. In the initial stage of a simulation, 
agents are assigned with random allelic types and the partnership network is a square 
lattice with periodic boundary conditions. Each agent can host a PGG in which his 
linked neighbors can participate. Agents adjust their partnerships or update their 
strategies asynchronously in a randomized sequential order. The simulation 
algorithm consists of the following phases: 

Phase 1 (Initialization). A population is generated and the simulation parameters 
are initialized. 

Phase 2 (Coevolution). Steps 2.1-2.3, which constitute a complete MCS, are 
repeated until the coevolution reaches a steady state. 

Step 2.1 (Game interaction). The fitness of each agent is set to zero. Each agent plays 
PGGs with its linked neighbors and obtains payoffs. 

Step 2.2 (Reputation evaluation). Each agent's reputation score is renewed 
according to Eq. 4 or 5. Each agent who exhibits the R trait has to pay a certain cost. 

Step 2.3 (Update dynamics). A random agent adjusts his partnership with a certain 
probability (Action 2.3.A) or otherwise updates his strategy (Action 2.3.B). 

Action 2.3.A. The focal agent severs the tie to the neighbor with the lowest repu- 
tation and reconnects to the next-nearest neighbor who has the highest reputation if 
he exhibits the R trait. 

Action 2.3.B. The focal agent changes his strategy by imitating one of his neighbors 
under a probability function, i.e., Eq. 3. 

The equilibrium frequencies of strategies for one simulation run result from 
averaging over 10^ MCSs after a transient period of more than 10^ MCSs. This 
procedure was repeated 32 times to produce each data point in Figs 1, 2, 3, 4, 7, and 8. 
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